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2 

Ashis Sen Gupta 
Stanford University 


1. Introduction. 

Let X be a p-dimensional random vector variable with dispersion 
matrix I . Two well-known measures of multidimensional scatter, obtained 
by generalizing the variance, the univariate measure, are E and the 
generalized variance (GV), J E J = det(E), introduced by Wilks (1932 ,1967). For 
multivariate normal populations, Likelihood Ratio Tests (LRT's) for E’s, 
of course of same dimensionalities, and some optimum properties of these 
tests are known. But, when multidimensional scatter of populations of 
different dimensions need to be compared, these tests cannot be defined. 
However, using j E | , which we will nomenclature as Standardized Gen¬ 

eralized Variance (SGV), such comparisons become meaningful. Since |E| 
represents the volume in p-dimensions, note that j 5T [ ^^becomes a mea¬ 
sure so scaled as to become comparable with scatter for a scalar random 
variable. Apart from this generality, need for tests of generalized vari¬ 
ances has been also felt, on its own right. |E| , being a scalar, is 
more suitable and easier to work with than the matrix E . Hoel (1937) was 

1 This research was partially supported by National Science Foundation 
Grant SES79-13976 at the Institute for Mathematical Studies in the 
Social Sciences, Stanford University. 
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Part of this paper was presented by the author, as an invited speaker, 
at the Conference on Applications of Numerical Analysis and Special 
Functions in Statistics, University of Maryland, October 6, 1980. 
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probably first to realize this need and later Eaton (1967) studied 
some problems of statistical Inference associated with a single GV. 

Blckel (1965) and others have used "generalized variance efficiency" for 

comparing various test procedures. The GV has been extensively used in 
applied research, e.g., by Goodman (1966) in Agricultural Statistics, 
Gnanadesikan and Gupta (1970) in Ranking and Selection, Arvanitis and 
Afonja (1971) in Sample Survey, Kiefer and Studden (1976) in The Theory 
of Optimal Designs, etc. While the estimation, e.g., van der Vaart (1965), 
Shorrock and Zidek (1976), and the distribution, e.g., 

Mathai (1972) have been studied In some detail, little seems to be 
known about tests for GV's. This paper attempts to bridge that gap. 

Suppose X^'s are independently distributed as N p (U^*^) > ^ 

being general dispersion matrices, i = l,...,k . LRT's are derived for 

1/p i 2 1/p i 

Hqi : |E ± | - a Q (given) > 0, for some fixed i; H Q2 : [ij 


1/p, 

|Dji , for some fixed i and j and finally 


1/p. 

]E i | all equal. 


i » l,...,k against appropriate two-sided alternatives. The test cri¬ 
teria turn out to be quite elegant multivariate analogues to those in 
the univariate cases. 

The solutions to the distributional problems associated with the 
various test statistics considered above need extensive use of Special 
Functions. The exact distributions for both the null and nonnull cases 
are presented for most of the above test criteria. The percentage points 
of these distributions can be obtained from existing mathematical tables 
since the distributions are represented in suitable computable forms. 
Examples of construction of tables and the general procedure of obtain¬ 
ing them for such computable forms of the distributions exist in current 
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literature, e.g., Mathai and Katiyar (1979). Further, many of the existing table 
can also be exploited to give the percentage points. Large sample approxi¬ 
mations to the above distributions are also presented. 

Finally, considering general E^s equal dimensions, for H Q1 
and Hq 2 » it is shown that the "modified" LRT's are unbiased—a result 
parallel to Sugiura and Nagao (1968) on tests of covariance matrices. 

2. Applications. 

In addition to the mathematically interesting nature of the problem 
and the applications cited above, there lies a rich, fertile area for 
numerous applications of the SGV's. In fact, wherever variance is 
employed for univariate situations, SGV's seem to be applicable for the multi¬ 
variate situations of 'overall' variability. Some examples are cited below. 

(a) Multivariate Quality Control . It is well known [e.g., see 
Steyn (1978)] that testing H^: the population mean vector y^_ of 

X , X^ ~ N^U^.E), remains constant during the sampling process against 

the alternative that y r varies during the process, is equivalent to 

testing H^: GV * |E] against GV * )E*| , where E* = (I+2D/n)E, 

D * £ y y' E , n being the sample size. One of the many applications 

r=l ~ r ~ r 

of this result and, hence, test for SGV can be seen in multivariate quality 
control. 

(b) Generalized Canonical Variable(GCV) Analysis . When the p-corponcnt 
original vector can be divided into k > 2 mutually exclusive groups, 

Anderson (1958), Problem 5, pp. 305-306, proposed GCV's to be obtained by 
minimizing their GV. Steel (1951) and Kettenring (1971)(as in his Ph.D. thesis) 
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have attempted to construct such GCV's. Sen Gupta (1980a) constructed 
GCV's which are equicorrelated and formulated and studied some inference 
problems associated with such GCV's. However, no results on statistical 
inference associated with the GCV's obtained by Anderson-Kettenring- 
Steel's met ho’ are available. If the original vector can be meaning¬ 
fully subgrouped into < k£ < ... < k g groups, then one is faced with 
the problem of choosing between canonical variables with dimensions 


k^, i * l,...,s . For the k^-th subgrouping, the original vector is 

split into k. mutually exclusive and exhaustive classes, the j-th 
1 k t 

class consisting of p. elements, £ p. = p, i = l,...,s . Obviously, 

J j=l 

since the criterion of minimum GV is being used to represent the original 


vector by a smaller dimensional one, a GCV with the smallest dimension 
k^ , will be preferred provided they all have the same comparable GV's, 
i.e., the same SGV's. Also, it may be meaningful to represent the ori¬ 
ginal vector by different GCV's of the same dimensions, k , but whose 


components are derived from different subgroupings of the original vector. 


Gnanadesikan (1977), pp. 74-77, drawing from a well-known example in 


psychometry, considers three (=k) sets of scores by several people on 


three batteries of three tests each, i. e., p^ = = p^ = 3 . Ttie three tests 

in each battery were intended to measure, respectively, the verbal, the 


numerical, and the spatial abilities of the persons tested. He commented, 
"An interesting alternative analysis in this example (...) would be to 
regroup the nine variables into three sets corresponding to the three 
abilities measured rather than the three batteries of tests." If the prob¬ 
lem is considered in the more general set-up that the observed values 
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merely constitute a sample from the underlying population, then how 
profitable the suggested alternative analysis will be must be judged 
through related statistical tests of significance. Furthermore, similar 
problems for a combination of the cases considered above, i.e., where 
one is faced with the problem of comparing several GCV's, some of which 
may possibly differ in dimensions, are quite important from practical 
considerations. Such statistical problems as comparing GCV's of possibly 
different dimensions obtained by Anderson-Kettenring-Steel's method can 
thus be formulated as tests for SGV's. 

(c) Generalized Homogeneity of Multidimensional Scatter . Dyer and 

Keating (1980) were interested in the homogeneity of variances of sealed 
bids of five Texas offshore oil and gas leases. The variances were com¬ 
puted from the biddings of the oil companies, whose numbers and identities 
varied from lease to lease. In similar situations, if repeated observa¬ 
tions are available for the leases or if the variabilities (for the 
bidding patterns) for certain companies for repeated bids over possibly 
different numbers of leases need to be compared, then one is faced with 
the problem of testing homogeneity of "variances," which we will term 
"Generalized Homogeneity" for vector variables of possibly different 
dimensions. This will be equivalent to testing homogeneity of SGV's, 
which is precisely defined above. 

(d) Missing Observations on a Random Vector . In cases of missing 
observations on part of a random vector [Dahiya and Korwar (1980), Goodman 
(1968)], generalized homogeneity as considered above can be quite useful 
for comparing multidimensional scatter. Suppose for an agricultural product 
P, there are q important characteristics with regard to variability. 
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Let the complete observation vector be X = (X^,...,X r ), X^: * 1, 

r 

i ■ 1.r, £ q ■ q. Let observations be available from population 

1 1 

j on only tj £ r of the X^'s, say, with no loss of generality, on 
X * (X 1 ,...,X ), j * l,...,k. Since usually for an agricultural 

j t i 

product P, the overall variability of the product can be represented 
in terms of the volume, one can use GV as a measure of this overall 
variability. In dealing with populations of maize and cotton, this 
notion was reflected by Goodman in his concluding remark: "The six 
cotton populations indicate even more clearly that the generalized 
variance is a useful measure of overall variability that merits futher 
investigation." Let it be desired to compare the multidimensional 
scatter of P over the k populations. Due to missing observations 


one possibility is to consider the random vector, say X , u - Min t. 

j J 

for which observations are available from all the populations and 
compare the GV's of X^. This was the approach of Goodman's analysis 
where "Only plants for which complete data were available were used for 
these analyses." This approach however results in loss of data. The 


other and better alternative, in case the missing observations are in 

the format described above would be to compare the SGVs of the X 

r J 

[If measurements on the components are in different units, we can consider 

the standardized scatter coefficient, where scatter coefficient, defined 

by Frisch (1929), is the determinant of the correlation matrix]. This 

seems reasonable since more characteristics related to the variability 

* 

of P is considered through X 's. 

3 

(e) Ranking and Selection . The method of ranking and selection 
based on GVs as proposed by Gnanadesikan and Gupta (1970) for 
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equi-dimensional vector variables has a natural extension to an initial 
test for SGVs for vector variables differing in dimension followed by 
ranking and selection procedures based on the SGVs. 

(f) Cluster Analysis . Often in cluster analysis, after the clusters 
are identified the question of homogeneity of clusters is of vital impor¬ 
tance, as in Mezzich and Solomon (1980). Tests for SGVs can be exploited 
advantageously for such circumstances. 

3. Likelihood Ratio Tests for SGV's . 

Let X 'V Np(y,E). Throughout our discussion, unless otherwise stated 
we will assume £ to be non-singular. Denote the population SGV of 
X, |l|^ p by A 2 and that of the sample, js/N|^ p by d 2 where S 
is the sample sums of products matrix based on a sample of size N. Also 
denote |s|^ p by s 2 . [Note that, Anderson (1958) defines GV with the 
divisor N-l instead of N]. 


3.1. Test for a Specified Value of SGV . Let x^,...,x N be a random 

2 2 

sample from N^(y,I) and suppose we want to test H^: A * (specified) 
2 2 

against H^: A 5* °q • (Note that Hq is equivalent to the hypothesis 
that the GV, |s| , has the specified value a 2p ). Since the Hq does not 
strain y, we have y * x. To find the MLE of 2 under H Q , consider 


* - in C + [f In 0 t - +| Jin s 2p - Jin 8 ± - Jin p 2p , 

where C ■ (211) Np/,2 |s| N ^ 2 , X Is the Lagrange undetermined multiplier, 
9^* i ■ 1, ... , p are the characteristic roots of 

and we have used the fact that |z|^ p ■ p 2 is equivalent to 

8 P “ \ " *> n ^q P * s 2p ■ |S| . Differentiating 4> w.r.t. 0^ 

and equating to zero, we have N - X - 0 , i - l,...,p. So, 


con- 
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(N-X) p - s 2p /cr 2p ♦9 1 - s 2 /a 2 , i*l,...,p . 

Hence, 

L/L » (La N ^ 2 exp(-p/2 a*^ P ) * f(a), say, where = (e/N) np ^ 2 and a»s 2p /a 2p . 
co Q 

However, 

f(a) t a < N p and 4- a > N p . 

So, we get, 

2 2 2 2 

Result 1 . The LRT for H Q s A - 0 Q against H^ A J a Q can be 
equivalently given by. 

Reject H Q iff d 2p /a 2p >a Q or < a 1 , 

where a^ and a^ are constants to be determined from the specified level of 
the test. 


3.2. Test for the Equality of the SGVs of Two Independent Multi¬ 
variate Normal Populations . Let x y , u - 1,...,N^ and y y , v = 1,...,N2 

denote two Independent random samples from N p (y^.Zj), i ■ 1,2 respec- 

2^2 22 
tively. We are interested in testing Hq: A^ “ A^ against H^: A^ 1* A 2 » 

2 

where A^, i ■ 1,2 are the population SGVs of X and Y respectively. 

As in Section 3.1, y^ - x, y^ ■ y. Let 0^, i ■ l,...,p^ and ri , 
j - l,...,p 2 be the characteristic roots of X^S^ and re 8 P ec ~ 

tively where S^, i ■ 1,2 are the sample sums of products matrices for 
X and Y respectively. For finding the MLEs of E^s under H^, consider 
as before. 
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where C is a constant, A is the Lagrange undetermined multiplier, 

2p ... 1/ Pl . , 1/P 2 

* = |s |, i * 1,2 and we have used the fact that |EjJ = [ E 2 I 
is equivalent to the quantity within the third bracket on the right-hand 
side of (3.1) being zero. 

Differentiating w.r.t. 0^ and n^s and equating these to zeros, 
we have 


' 2 Pl 2 Pl ] 1/p l 

** A * p^0^ $ 1* 1) • • •)p^ *^ ”* ^ S /a 0 


P2^2 ^ * 3 * • >p2 P2^2 ^ ”* ^2 s 2 ^0 


2p, 2p 2 | 1/ f 2 



These give 0^ - (pjS* + P2 S 2^^ P 1 N 1 + P 2 N 2^ * ^ p l N l d l + p 2 N 2 d 2 ^ P 1 N 1 + P 2 N 2 
2 

where d^ , i = 1,2 are the sample generalized variances for 
X and Y respectively and - A 2 ~ a 2 , the common unknown value. Note 
that Og, the MLE of 0 * agrees with that in the case of P x “ P 2 “ 1 * 

Also 0Q Pl and a* 2 give the MLEs of |Z 1 | and j^j, the generalized 
variances of X and Y respectively. Using these estimates, the Likelihood 
Ratio criterion is given by. 



(3.2.1) 


l/a ‘ H 


2 ) Vl /2 (s 2 ) N 2 P 2 /2 l j(a^) (N 1 P 1 +N 2 P 2 )/2 


C. a 


N 2 P 2 /2 


(P 1 a+P 2 ) 


(N lPl +N 2 p 2 )/2 


= f(a), say, where a 


2 2 

s^/s 2 and C is a constant. 


However, f(a) + a > N 2 p 2 and + 0 < a < 0 N 2 p 2 / N i p i • Tllus 

Result 2 . The LRT for H^: A 2 = A 2 against H^: A 2 4 A 2 can be 
equivalently given by. 


2 /j2 


Reject H q iff R * d 1 ^ d 2 < or > r 


where r^ and r 2 are constants to be determined from the specified 
level of the test. 


3.3. Test for the Equality of SGVs of k(> 2) Independent Multi¬ 
variate Normal Populations . Let ^ * 1,...,N , i = l,...,k denote 

k random samples from k independent populations , i » 1,..., 

respectively. We are interested in testing Hq: A^ , i » l,...,k 
all equal, against the alternative H^, that at least one of them differ. 


k 


As before, under both Hq and , jj ■ , i “ l,...,k . Let 

, i • 1.k, j ■ l,...,p^ be the characteristic roots of 

respectively where S^, i ■ l,...,k are the sample sums of products 
matrices for , i » l,...,k respectively. For finding the MLEs of 

, i • l,...,k , under Hq , consider as before, 
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(3.3.1) 


k Pi 


*•0+1 l 

1-1 j-1 1 


1 1 
-4 in e.. - 4 e. 


ij 2 Ijj 


1=1 


lx , 

' 

" 1 2 
Z — Jin 8 - Jin a . 


2 ii+1 

b 

j P t ij i 



Z —— 2-n 9 . -Jin 

[j *1+1 1+lj 1+1 


where C Is a constant and X are undetermined Lagrange multipliers 

with k+1 being replaced by 1 in the suffixes. Differentiating <J> 
with respect to an< * equating to zeros we have. 


p i K i +<x n+r\-ii ) ■ »i e ij ; 1 * 1 . k - J " 1 . p i • x oi ■ V 


* v ■* e y - s l /5 o • 1 ■ 1 . k ■ 


*2 2 2 
where is the MLE of (T Q , the common unknown value of A^ , i=l,...,k. 

So 


5 0 E p i N i + S 0 I J 1 <X ii+l* X i-li )1 “ Ep i S i 


S 0 " E Pi s i /E p i N i ‘ 

.i. i 


Note again this agrees with the MLE for 0^ of the univariate case. 
Hence, we get. 


Result 3 . The LRT for H^: A^ , all equal, against H^: at least 
2 


one of the A^ , i - l,...,k , differ is given by 

k 

n 

i=l 


k N p /2 

Reject Hq if and only if , n - II (d^/a^) 11 < 
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r 



where Dq is a constant to be determined from the specified level of 
the test. 


4. Exact Null and Non-Null Distributions of the Test Criteria. 


4.1.1. Definitions and Decisions . In order to obtain the exact 
distributions under null and alternative hypotheses, for the test 
statistics considered in Section 3, we recall the following definitions 
and preliminary discussions from Mathai [(1970), (1972), (1973)]. 

Meijer's G-Function. The G-function is defined as. 


(4.1.1) 


G m ’ n \z ai ”“’ a p = (2 tt i) —1 h(s) z" S ds , 

p,q t. b 1 ,...,b] Jl 


where i - (-l) , z is not equal to zero and 


2 

z = exp{s (£n|z) + i arg z)} 


in which £n |zj denotes the natural logarithm of jz| and arg z is 
not necessarily the principal value, and 


n ra q p 

h(s) - n r (i — a — s) n r(b +s)/{ n r(i-b -s) ir r(a, + s)} 

j»l j=l ^ j=m+l J j-n+1 ^ 


where m, n, p, q are integers such that, 0 S n S P , l^.m<_q 


a^(j ■ l,...,p) , b^ (j * 1» • • • »9) 


are complex numbers such that 






(b h + v) ^ (a - 1 - r) , for v, r = 0,1,..., 


L is a contour in the complex s-plane such that the points, 


(4.1.2) 


-s = (bj+v) , j - 1,... ,m ; v = 0,1,... 


and 


-s “ j = 1 . n 5 V “ °* 1 —* 

are separated and the points in (4.1.2) are enclosed by L . An empty 
product is interpreted as unity. The function (4.1.1) makes sense in 
the following cases 

For every z 1* 0 if q - p is positive and for 0 < |z| <1 
if q - p = 0 . The existence of different contours L is discussed 
in Erdelyi (1953, p. 207). 

The Gauss-Legendre Multiplication Formula . 

(4.1.3) T(mz) •= (27T) (1-m)/2 m m2-1 /2 II T(z + j/m) , m - 1,2,... . 

j-0 

This formula (4.1.3) enables one to write a Ganma of an integral 
multiple of z in terms of Gamma of z. 

The technique of inverse Mellin transform can be advantageously 
employed when a moment sequence uniquely determines a distribution. 
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Rao (1973, p.106) has given a number of sufficient conditions for the 


unique existence of the density function. If y s - 0,1,... 

is the (s-l)th moment about the origin of T, uniquely determining 
its density, then the density g(t) of T is given by the inverse 
Mellin transform 


(4.1.4) 


g(t) = (2 tt i) 


C+ioo 

C+ioo 


U 


s-1 


-s 

t 


ds 


where i = /^T and C is a suitably chosen real number. For multi¬ 
variate normal populations it is seen that at least one of the condi¬ 
tions cited in Rao is satisfied by the moment sequence. 

A computable representation of the G-function of (4.1.1) using the 
Calculus of Residues is given by. 


(4.1.5) 


s m,n 

p>q 


a i. a P 

i h^,...,b^ 


n 


. n m. -1 

t r r jrr-v 


= l l A + I I l l R 

k=l j=l J r*=l j»l v— 1 i»0 


where A^ is given in (5.21) and R vi is given in (5.27) of Mathai 
(1970, p. 141). 

Plncherle's H-Function . The G-function is a special case of the 
most general Special Function, namely the H-function. Braaksma (1964) 
has discussed the H-function in detail. Following Mathai, slight modi¬ 
fications of the original definition are made here in order to represent 
it as an inverse Mellin transform. We define the H-function as follows. 
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(4.1.6) 


_ n " |(a ,a) 
H(z) - H® >n z 1 1 P P 

p,q L '(b l ,B 1 ),...,(b q ,3 q )J 


m 


{. 


n ra>,+ fls) n r(i-a.-a A s) 


n»l 


J "j 


J=L 


j “j 


2tt 1 j q p 

n r(i-b -m) n r(a + a ,s) 
j-m+1 3 3 j*=n+l 3 3 


“ 8 j 

z ds 


where 1 * /~1 , p, q, m, n are integers such that 1 £ m ^ q, 0£n_<p 
(j * l,...,p) , g (j ■ l,...,q) are positive numbers and 
a^ (j * l,...,p) , b (j ■ l,...,q) are complex numbers such that. 


Oj(b n +v) ^ 0 h (aj-l-X) , for V, X» 0,1.. h « l,...,m ; j » 1.n 


L is a contour separating the points. 


(4.1.7) -s*> (b^+v)/^ , j - l,...,m; v - 0,1,, 


and 


(4.1.8) -s - (aj-l-v)/aj , j - l,...,n ; v - 0,1,... . 


One condition of existence of the H-function is that it exists, for 
every z ^ 0 if u > 0, where 


V " l “ l “j 

1 3 i J 


and for 0 < |z| < g ^ , if y-0 where 
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1 


P a, q -0, 

g = rr a J n g J . 

i J i j 

The result due to Braaksma [p. 278, (6.1)] which effectively 

says that H(z) is available as the sum of the residues of h(s) 

_s 

z in the points (4.1.7) is not affected by the modification mentioned 
earlier of the original definition of the H-function. A computable 
representation of the H-function, with the detailed method of identify¬ 
ing the poles is given in Mathai (1973) as follows. 


m 

(4.1.9) H(z) -11*. 

j-1 J 


where the second sum is over . defined in (4.10) and 

J 1 J m 

defined in (4.22) of Mathai (1973). 


R. 

3 


is 


4.2. Exact Distribution of d^/ 0 Q P . Since the sample generalized 
variance d^ p , arise frequently in many multivariate tests, various 
authors, e.g. Mathai (1970), Consul (1964), have worked 
on its distribution. Mathai's work seems to be quite suitable in view 
of the distribution that will be needed for Section 4.3. Letting 

n r\ 

v* d P /0 q P and using (4.1.4), the density of v, g(v), under 
Hq , can be written as 


(4.2.1) 

where 


g(v) - C(2wi)' 


n r[(N-j)/2+h]v ds 
j-1 


C G P, ”(v|(N-j)/2 , j = 1,2,...,p) , 0 < v < 


c" 1 - 5j_ r[(N-j)/2] 
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1 


A computable representation of g^(v) is then obtained by substituting 
m * p , n ■ 0 , p = 0 , q « p in (4.1.5) 

Explicit evaluations for the residues needed in (4.1.5) are given 
in Mathai (1972). For this, Mathai considered a random variable with its 
h-th moment given by, 

(4.2.2) E(x h ) = (-) n IL ( s rAL/ , 2 ± h J. n r il n . iA)/ 2 - h J 

a i-1 r[(n-i)/2] i-1 r[(m-i)/2] 


Assume the moment sequence in (4.2.2) uniquely determines the density of 
X. Then, through the use of Calculus of Residues he obtained the 
following 

Theorem Ml . The density function f(x) corresponding to the moment 
sequence in (4.2.2) is as follows. 


f(x) = C(ax/8) m/2 (p+1)/2 x -1 l (R +R') , x > 0 . 

j=l 3 3 

a.-l . 

1 , j a -1 a 

R. * [a -l)!]" i (ax/8 ) 3 l ( 3 ) (-log ax/3) J 

J J r »'o 


a.-1-r 


r-1 


- * _ , (r-l-r.) 1 1 r-1 (r -l-r 2 ) 

x ( l ( r -3 ) X A 1 l ( r )A 1 }B » 

r^O r l r 2 -0 r 2 

Rj is Rj with (ax/8)^ , a^ , A^ , B are replaced by (otx/fj)^ *^ 2 , 
b^ , A'(t) and B' respectively. The various quantities for the two 
cases p-even and p-odd are given in pp. 166-167 of Mathai (1972). 

For v , the moment sequence given in (4.2.2) determines the 

- 0 , 


density uniquely according to Rao (1973, p. 106). Letting p' 





a ■ 1 , 3 = 2 P ]E]/°q P and m = n , the density of v is deduced in 
a form suitable for computation. Using the specified values for |z| 
under the null (3 = 2 P ) and alternative hypotheses, the corresponding 
distributions of v are obtained. 

4.3. Exact Distribution of R . Because of its frequent applications 
in various multivariate test procedures, distribution of the ratio of 
two independent sample GVs have received much attention, e.g. Nandi 
(1977), Tretter and Walster (1975), Mathai (1972) etc. 

Case 1. = p^. The test statistic R will be equivalent to the 

test statistic based on the ratio of two independent GVs, say, w, only 

when p^ = p 2 * p, say. In that case, the density of w = R P is 

2 2 

obtained from Theorem Ml above by letting a = 1/A^ and 3 * l/Ai . 

7 2 2 

Under Hq a/3 = 1 . Under any alternative H: A^/ 05 6 , the 

2 2 2 

distribution of R is obtained by substituting a/3 = A 2 /A^ “ 1/6 
in Theorem Ml. 

Case 2. p^ 1 * p^. In the case of unequal dimensions, the distribution 
of R is not available. We obtain the distribution in terms of the 
H-function and present it in a computable form through the use of Calculus 
of Residues. Now, 

h P 1 p 2 

( 4 . 3 . 1 ) E(R h ) * c (a?/a?) n r{|(N.i)} n r4(N.-—-i)} 

• 12 1 2 1 1 22 P 2 

where 

k p 2 

c - [ji r((N -i)/2} n r((N.-i)/2}] . 

1 1 
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Hence from (4.1.4), on using (4.1.6), the density of R, g 2 (r), under 


Hq , can be written as 


(4.3.2) g 2 (r) = (2iri) 


U 


E(R h ) r“ h dh 


- C, • H 


| »Pl) » * * » » ) 

r 1 1 p 2’ p l 


P 1’ P 2 

P 2 ,P 1 L l (b 1 ,p 2 ),...,(b 


P 1 ,P 2 


, 0 < r < 


Since here the a^s and the ft^s in (4.1.6) corresponding to (4.3.1) 
are all rational numbers, we can use (4.1.3) to express g 2 (r) in terms 
of G-function also. However, use of H-function here is a more direct 
and convenient approach. 

In order to present (4.3.2) in a computable form we introduce the 
following notation. Consider (4.1.6). The poles of r(bj+£jS) are 
given by the equation 


(4.3.3) 


-s => (bj+v)/B , v = 0,1,... . 


If the point -s =* (b^+v^)Bj for some V = e {0,1,...} coincides 
with the poles coming from y-1 other Gammas of the set r(b^+BjS), 
j * l,...,m then the point gives a pole of order y . In order to 
distinguish poles of all orders, for a fixed j , consider the equations 


(4.3.4) 


, w <i» b+v ( J 2 > 

1 Jl-J, 2 Jl-J. 


b,+v 


b +V 


(jm) 


m J i • •*J. 






m 
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We interpret (4.3.4) as follows. For a fixed j , j =0 or 1 for 

r 

r - 1.m . If j = 0 , then (b^+vp r ^ . )/S T . is to be excluded 


,(jj) 


r J 1 * * ’3 r 

l m 


from (4.3.4) v' JJ/ . is a value of v in (4.3.3). For every fixed 
3 l - " 3 m 

j , +•••+ j e {l,...,m} denotes the order of the pole at 

-s ■ (b.+vf 33 ^ )/(?.. To avoid duplication we will use the convention, 

3 3 1* * * 3 m 3 

while considering the points corresponding to (4.3.3), that 


3 1 = 3 2 = = 3 j-l = °* 

Denote by S^ 33 ^ . the set of all values v.^ 33 ^. takes for given 

3 1 ‘‘* J m 3 1‘’‘ 3 m 


(jj) (jj) 


3 1 ,3 2**‘’ ,3 m’ i>e - S j, 


{v. . } . Then 

3 1 * ‘ ' 3 m 

i m 


m 


(4.3.5) 


H(z) = l l 


j=i s: j3) . 3 

j.... j 

l m 


—«s 

where R is the residue of h(s)z at the pole - s»(b + v. . )/$. 
J 

^ i 1 ^ 1 m 

,, , v denotes the summation over all sets S. w ~ . "'''.notes 

s (jj) _ 3 1* * * 3 ^ 

3 1. . . 3 n 


(jj) 




the summation over all sets S 


(jj) 

j - ... j 
J 1 J m 


. More detailed discussions can be 


found in Mathai (1973). 

We now determine the poles and their corresponding orders needed 
in (4.3.5), i.e. only the first product of gammas in (4.3.1). It will 
thus be convenient, for computational purposes, to choose p^ < p£. 

N.-i . 

For a fixed i , the poles of T[—r— + —] » T are given by the 

z i 

equation 


-s - p 1 ((N 1 -i)/2 +v) , v - 0,1,2,... 
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Note that poles of and coincide only when i and j are both 

even or both odd. 

Lemma 4,3.1 . The poles, with their corresponding orders, are given by 
Case A. odd. 

N -1 

(4.3.6) {v^oiq. .. ioi^ * f0,l,2,...} » (v) . Poles are p^-j;— + v ) » 

p l+! 

repeated —^— times . 


{v 0CK^0101 • • 101 } = {0} * Pole is ’ 

j-1 

Pl + ^- A _1 

repeated —--times, j * 3,5,...,p 


1-1 


1 " 


N, -2 


^0101* •-010* = {O* 1 * 2 ****} = {^* Poles are P x (~—+v), 

P l -1 

repeated —— times . 


N i-J 


{v 0 j .^010..010 } “ {0} ' Pole is p l ( ~2 } » 

j- 1 

P l”^ 1-2 

repeated —^-— 2 ~ times , j * 4,6,...,p^-l 


Case B. p^ even. 


N,-l 


(4.3.7) { v iJJo---10 } “ (0,1,2,...) - (v). Poles are Pl (-~+v) , 


repeated — times . 
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{v o^oio..io } " {vl * Pole is PiHr 0 * 

p i j _i 

repeated — - times, j * 3,5,...,p^-l . 

( 22 ) V 2 

^ V 0101-•*01^ * {0,1,2,...} * {v} . Poles are p 1 (—~—+v) , 


repeated ■1 

2 


times 


{v oo».i..oi I ' {oK Pole ls • 

P 1 i-2 

repeated — - times, j * 4,6,, 


■ • 


For i j j , is vacuous unless i and j are both odd or 

both even. (We omit the subscripts of since it is clear what they 

are.) 


ft 1 

(4.3.8) (v v s = {v + ~ 2 ~) « v ■ 0,1,2,...;£ > 1 is odd . Poles are 
identified with those of . 

{v< 2£) > * (v + ^y^} , v * 0,1,2,...;®. > 2 is even. Poles are 

( 22 ) 

identified with those of v ' . 

{v< £ “ — 2 ~ , i > t' > 2 ; t't both odd or both even. 

Poles are identified with those of . 

Proof . The above results follow from the following observations. 
Consider Case A. Let i » 1 and j £ be any odd number. Then, 
poles of r L and r coincide, as 

N r 1 N i"J i+i 

Pl (-y- + V) - Pl (-±— + »■+ (v,A) - {(0, .... } 
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But this set excludes the poles coming from 
Consider j,j' both odd, 3 £ j < j’ £ . 


N,-j N -j • 

(4.3.9) Pl (-±— + A) = p^-^y- + A’) . For 


A e (0,1,..., - l) 

Then 



= E. 
J 


Thus considering the 'excluded sets' E^s we note that the smallest 
element, i.e. 0 , is repeated in all succeeding , through the 
relation (4.3.9). This establishes (4.3.6). A similar argument holds 
for (4.3.7). (4.3.8) follows from the definitions of the corresponding 

sets. 


Theorem 4.3.1 . The probability density function of R is given by 


for p^ odd, 


I 00 

, (r) = C £ (r/6 2 ) 

•[v=0 


V 1 -1 

P 1 (~- + v) (pyD/2 1 


l f (r/6 ; u.s^Aq.Bq) 

u=0 


Pi +1 

. 2 P 1 <N 1 -j)/2 J—V-l 

+ l (r/6 2 ) r " ,r2 


j 


l f(r/6 ; u,a ,A 0 ,B 0 ) 

u=0 J 


N r 2 -i 

» P x (—2~+v) (p r l)/2 

+ l (r/6 2 ) l f (r/6 2 ; u,b ,A ,B ) 

V-0 u =0 


+ W6 2 ) 

j 


Pl ( N r j )/2 1^.-1 


l f(r/6 ; u,b ,A 0 ,B q ) 

u**0 J 
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and for p^ even 


g 2 (r) 


Nr 1 P X 

Pl ( 2~ +v) ~1" 1 

C l (r/6 2 ) l f (r/6 2 ; u.a^.A ,B ) 

v=0 u=0 


+ l* (r/6 2 ) 


p i ( N r j)/2 


l f(r/6 ; u,a’ A ,B q ) 

u=0 J 


N -2 p 

Pl (-|- + v) -i-1 

+ l (r/6 2 ) l f(r/6 2 ; u,b,,A n> B n ) 

V=0 u=0 


p (N -j)/2 

fj ^ ^ O 

+ l (r/6 ) £ f(r/6 ; u,b ,A 0> B Q ) 

j u=0 J 


where 


V 1 ' 


l 0 ! u n _1_u f U ~ 1 u -1 

f(rju,d,A ,B ) - l u J (-log r> I ( 

V 0 u Iyj^-0 1 




Y.-l ^l -1 (Y,-l-Y ? ) 

x x r A A 

I t 2 a o 


u„ P 1 


B * (s-d) ® H r {-|-( N .. + —- i)} at s * d 
U , ^ i p. 


where d is a pole of order u^, (the upper 
limit +1, for u in the summation in the 
theorem) of the product of the gamma functions 
defined by Bq . 


“SI 108 ®0 ’ " - ° 


2 2 2 a aa 

C is the constant defined in (4.3.1) and 6 = ^^2 • Ej and 

jlf ^ 

denote the summations over all j e {3,5,...,p^} and j z {4,6,...,p^ ) 
* * 

respectively, p^ = p^ if p^ is odd and p^ = p^-1 if p^ is even; 
** 

p^ = if p^ is even and p^-1 if P-^ is odd. 


Proof . The proof follows by noting (4.3.5) and combining Lemma 4.3.1 
above with Lemma 1 of Mathai (1973). 

Finally, a convenient computational form of the p.d.f. of R is 
obtained from the following theorem proved as Theorem 1 in Mathai (1973). 

Theorem M2 . H(z) is given in (4.3.5) where, 


(4.3.10) Rj - 


(b.4v <j ^ )/6. 

+ Z 1 m ^1 + 1 




1 l 

r=0 


J 


x (-log z) 

r r 1 

* l 

r *0 


- 1_r 

1 m 


V 1 


r r-1 

y 

r-l 

1 

r r ° 

^ r l- 

v •••] 



(r-l-r 1 ) 


where the C^s and D^s are defined in (4.23) and (4.24) of Mathai (1973) 
We note that the sets . } are not needed for h < j in 

II • • • J 

J 1 J m 

(4.23) and (4.24) of Mathai. Thus, Lemma 4.3.1 gives us all the desired 
sets needed to use Theorem M2 above, which expresses H(z) in terms of 
the convenient computable functions, e.g. the Psi and the generalized 
zeta functions. Examples of computation of H(z) is given in Section 5 
of Mathai (1973). Also computational procedure and computer programs 
for calculating the percentage points of the distribution of R can be 
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obtained in a manner similar to Mathai and Katiyar (1979). The null 

2 2 2 

distribution is obtained by putting 6 = ^]/^2 = ^ * n ^*3*1) and the 

nonnull distribution by substituting the specified value, under the given 
2 2 2 

alternative, 5 = A^/A ^ in (4.3.1). It is known that for p.^ * 1 or 2 , 

2 2 2 2 

X j " P j n j u j ^j where n j = Nj " 1 and n j u j = N j d j * J * 1 » 2 is 

2 

distributed as ax with d.f. p.(n .-p.+l) . Hence if p. = 1 or 2 , 

J j J i 

p^ not necessarily equal to p^ , the exact distribution of R , under 
both the null and alternative hypotheses are obtained as central F,. r 

distributions, with obvious multipliers, having d.f. given by 
= P i (n 1 ~P i +l) , i ■ 1,2 . 


4.4. Exact Distribution of U • We consider a Bartlett type modi- 

2 2 2 
fication for n . Let X i * p^^ n A u^A^ where n^ - 1 and ^ 

2 

= Nf df , i * l,...,k . As in the univariate case, we propose the 
modified test statistic 


, k , , "i^Vi// k . /k \ 


(4.4.1) 


a 1 nx 

D 


i 1 / ih 


where 


b i “ (Vi^Vi* • c b “ 




For P, - 1 or 2 , 
representation (4.4.1) we get 


Using this result and the 
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p j 


Theorem A.4.1 . For = 1 or 2 , 
i^j,i,j * 1.k the exact density of 


not necessarily equal to 
is given by 

O 


f(t) - [rw/n «» 3 >] ( J »* J ) 

• £(2ir) (k ~ 1)/2 / r((k-l)/2)j t m-1 (-logt) 


(k-3)/2 


n m D K (t) , 

ID) 3 ) D 


o < t < 1 


where, 


m = J p. (n -p +l)/2, a « P (n -P +l)/2m, j=l,...,l 
ill j J J j 


bj , j=l,...,k are defined in (A.4. ) 

and n , is defined in Theorem 2 of Ghao and Glaser (1978). 

m,a,b 

Percentage points and approximations to the above distribution are 

obtained from Dyer and Keating (1980). For p i > 3, the distributions of 
2 2 

n or n seem to be complicated. 

B 


5. Large Sample Approximations. Some large sample approximations to 
the exact distributions of the test criteria considered above are now sug¬ 
gested. Existing approximations are also reviewed for the distribution of 
GV and SGV. 


2 2 

5.1 Asymptotic Distributions of GV and SGV. Letting nu = Nd , 
n = N -1, we have from Anderson (1958), Theorem 7.5.4, that for 
large N, 

(u 2p /A 2p -1) i N(0,2p). 

P 

It is know that n = N P v = |s|/jz| is distributed as jEi*N-i 
2 

where the x 's are all independent. Hoel (1937) suggested approximating 

1/P 

the distribution of i) - w by the distribution with the density function 


27 



J p(N-p) 4 p(N-p) - 1 -Cw 


r[i/2 P (n-p)] 


where, 


C = C (p ,N) t 1 - ] 

This turns out to be exact for p=l and p=2. 

Gnanadesikan and Gupta (1970) have suggested approximating the distri- 
1 1 ^ 2 

bution of lnw= — In n= — V In x , using the central limit Theorem, by 
P P L N-i 

the normal distribution. 

We now propose a new approximation to the distribution of SGV. An ap¬ 
plication of the general result of Madansky and Olkin (1969) with 
i/p 

h(v) -Jv| shows that, for large N, 

./N(d 2 /A 2 - 1) -* N(0,2/p). 

In the light of this approximation to the distribution of the SGV, it 
is interesting to note the approximation to the distribution of GV by Anderson 
stated at the beginning. Little is known about the relative performances 
of the four approximations discussed above. 

5.2 Asymptotic Distributions of R and n. Letting c^ = 0(1^,of 
Hoel's approximation, for large N 1 and N , the density of R can be ap¬ 
proximated by that of, 


where 5 2 « A 2 /& 2 ' 


C N p (N -p ) - 

2 2 11 1 6 F 

Q iWW VW-WV- 


The null and non-null distributions are obtained 


by putting 6 ”1 and the specified value under the alternative hypothesis, 
respectively. 


1 





2 

In addition to the usual x approximation to the likelihood ratio 
2 

criterion n , another approximation is presented here. If is large com- 
2 

pared to p_^, i=l,...,k then in the same lines of Hoel's approximation, we get 
2 2 2 

Xi = can approximated by a x variable with d.f. p^(N^-p^) , 

i=l,...,k. Hence, 

2 

Lemma 5.2.1 . If is large compared to p ,i=l,...,k, then the density 

2 

of n under Ho can be approximated by f(t) defined in Theorem 4.4.1 (where 
B 

p^s now can be any integers, not necessarily Is or 2s only). 

2 

Similar result is seen to hold for n also. These approximations are 

2 

expected to be better than the usual x approximations. 

2 

6. A Multivariate F Criterion A simpler statistic than n is now 
_ max _. 

suggested for the special case when we have an equal number of observations, 
N, from k populations, each of equal dimension p, e.g.. 


F 

p ,max 


= d 2 / d 2 
max 


min 


For p=l, this coincides with the F proposed by Hartley (1950) as a 

2 

short cut method for the univariate case. It is known that In x v , for large v, 

2 

is approximately normal with variance -2/(v-l). Hence,In d is approximately 

percentage points of F can be determined from 

IH3X 


normal, for large N, with variance 


_ 1 I 2 _ 

" P 2 j-i ^ 


Thus the approximate 


F 

p,max 


(a) 



where r^fa) is the lOOaX point of the range r, in independent normal 
samples of size k. Tabulated values of t^(a) are available from Pearson 
and Hartley (1966). 

7. Example. The following example is taken from Gnanadesikan and 
Gupta (1970) who were interested in a ranking and selection procedure 
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« 


based on generalized variance. They considered 5(=p) - diminesional sum¬ 
maries of speech spectrographic data from a talker identification problem. 
The data consisted of 7(=N) replicate utterances of 10(=k) words for one 
particular speaker. Then, 

1/5 

F = (720616.4465/1.5411) = 13.6137 

p,max 

and F (.01) = 9.0737 

p,max 

Hence, the hypothesis of equal multidimensional scatter, as measured 
by SGV, is to be rejected. 

8. Unbiasedness of the modified LRT's for and • 

The results in this section show that tests for SGV's possess the 
property of unbiasedness as do the corresponding tests for covariance 
matrices. We will consider the modified LRT's obtained by replacing the 
sample size by the degrees of freedom n^ = JL - 1 in the original 

LRT, and unbiasedness of these tests for and for Hq 2 in the case 

of equidimensional vector variables will be established. The proofs for 
the covariance matrices were given by Sugiura and Nagao (1968). 

8,1, Unbiasedness of modified LRT for . 

Theorem 8.1.1 . For testing Hqj:|Z| = Oq P against the alternative 
H^^:|Z| 4 cfgP for unknown y , the modified LRT given by replacing N 
by n * N - 1 in the original LRT is unbiased. 

Proof : For given > 0 , let b^ = {Z: |Z| = OQ P ,Zp.d.} , 

S 1 ‘ «' l £ l * ^ P . Ep- d ->. 

Then 


To prove Th. 8.1.1 , it is then enough to show that for any Z fc b^ , say 
Iq fc Sq and for any Z fc , say Z^ fc b^ , 
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(8.1.1) P(w|H 01 ,E 0 ) - P(a)|H n ,Z 1 ) < 0 


where is the critical region of the modified LRT, i.e.. 


(8.1.2) a): S/S is p.d. and 


-Np/2 


expj-p/2 IS1 /cTq P ] | 

<*] 


>-Np/2 


.3) =w:js|s is p.d. and [|] | z -l s |(N-l)/2 exp j--P| Z^S | 1/p J 

< c a VZ 0 fc S 0 ’ 1- *” l £ ol*°0 P J- 


This is so because if (8.1.1) is true for any arbitrary (Z ,E ) fc (S^&p , 

then it is also true for any arbitrary (E^.E*) fc (Sq,S^) or (Z^.Z^) 

fc (fep,^) and, hence, this will imply P(u)/H 01 ) - P(uj,H^) £ 0 . 

Let gfcO(p), the multiplication group of pxp orthogonal matrices, be such 

that g is a diagonal matrix T where E^^ 2 = (^q^ 2 ) * 

1/2 1/2 

Zq Zq = Z Q . Without any loss of generality we can assume that 

Eq = I and Z^ = T , the diagonal matrix where diagonal elements are the 

- 1/2 - 1/2 

characteristic roots of Z^ E^Eq * Hence under H^,S he® a Wishart 
distribution with parameters T and n = N - 1 . Now 


P(w|H 11 ) = c j s j(N-p-2)/2 j r |-(N-l)/2 exp {_ I tr f^sMs 


c |u| CN-p-2)/2 i tj . u}du 

,* n.P 2 


where 


c -1 - TT P(P " 1)/4 2 np/2 n T((n-i+1)/2) , 

n >P -t.i 


u - r ' 1/2 sr _1/ ^ 2 


u>' - {U: U p.d., r 1/2 ur 1/2 fc 0)} . 
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Also 


If I - |rr (p+1,/2 


Since a)' «* u> when is true and in the region w , letting 

0^,i = l,...,p be the characteristic root of U, 


f(U) = |U| 


(N-p-2)/2 exp( _ 1 tr 0) = ^ 


(N-p-2) /2 exp( _ 1 ZQ j 


we get 


Further, 


< c' U 
— a ' 


< c' 
— at 


~(p+l)/2 
Np/2 


r \ Np/2 r ( ^0. , 

(f) 


uf (p+1)/2 - g( U ) , 


| y | (N-P-2) l tr y^y 


OJ-wnai 


exists and 


g(U)dU < «> 


ujnu) 


w-uinai 


, f (U)dU _< 


oi-oinw 


, g(U)dU 


and 


Thus, we get 


[ * , f(U)dU < - [ , n * g(U)dU 

J cj^-ojnto 1 7 — J co -oiHcj* 


P(u)|H 01 ,E 0 ) - P(wiH 11 ,E 1 ) 


-cjnw* 

)Np/2| 


-c /f 

n,p|Jw- 




f(U)dU 




co^—wncu* 


,}i»i 


-(p+l>/2 


dU 
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- v 4 r { w >'-'-' 2 


The last inequality is due to the fact that |u| is the invariant 

measure in the space of U under the full linear group G^(p) transform¬ 


ing U -*• gUg' , g fc G^(p) , that is. 


, u j-(p+l)/2 du 


i-(p+D/2, 


| u Ju|'' r -d U. Q.E.D. 

Precisely this property was exploited by Sugiura and Nagao to prove the 

unbiasedness of the modified LRT's for £ = £ Q and ^ « £ 

Theorem 8.1.1 can be generalized to the k-sample case. Let x^: p*l 

1-1,...,N (N >p) be a random sample from N (p,£.), j=l,2,...,k . Let 
3 3 P J 

S be the sample sums of products matrix and n. = N.-l . Using the same 
3 3 3 

argument as in the proof of Theorem 6.1.1 we have the following theorem. 

Theorem 8.1.2 . For testing the hypothesis : 12^. [ = 0 ^. 

(j” 1 -, 2 , • • • > k ) against the alternatives + Oq for some i , 

2 

where the mean VL is unspecified and <7^ > 0 is given, the modified LRT 
having the critical region 


u)’: 


(S^,•••,S^)/S. is p.d. (j—1,••.k) 


and 


k 

n 

j-i 


ft 


ft p/2 r 




V 2 


exp < 


1 1 > 
1 s | 

I/P'1 


>< c" 


/ v c 

1 a 

Oj 

) J 


is unbiased. 
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Consider further the problem of testing ^ against 


H" :|Z| 1/P > 0? 


^ where 0 < (^ < . As a modification of the LRT in 


Section 3.1 consider the test 4 >q, 


4>n : Re J ect H" iff ! s l /o n P > c n 


Theorem .1.3 . For testing against , the test <}) is the 
uniformly most powerful invariant. Also (J>^ is a maximum test and is most 
stringent. 

Proof . See proof of Proposition 1 of Eaton (1967). 


8.2. Unbiasedness of the modified LRT for [ | = | | . Let 

^ N p< U l ,Z l ) ’ i,Bl »*** ,H l and ~j V M 2 ,E 2 )’ j = l»---» N 2 denote two 
independent random samples from two independent multivariate normal popula¬ 
tions. For testing H Q2 : ] | 1/p = |E 2 | 1/p (or equivalently 1^1" 
against H 12 : |£ t l^l 1 ^ » the critical region of LRT is given by, 

from (3. 2 .1), as 


< c' , a constant 


f 2 1 


r 2) 

K 


ill 


(8. 2 .1) w - \( s i» S 2 ): 2 >(N 1 +N 2 )p/ 2 < C d 


S l +S 2 

V N 2 


where s^ K ■ |S^|, , i«l, 2 are the sample sums of products matrices 

for X and Y respectively. 


Theorem 8. 2 .1 . The modified LRT for H^ 2 |Z^| 


|Z 2 | against 


H l 2 : I^* 1 1 ^ 1^1 obtained by replacing by n^ « Nj-1, i“l, 2 in 

(8. 2 .1) is unbiased. 


Proof . As in Section 8.1, let S q » {Z:(E 1 ,E 2 >:|zj = |z 2 | ^.Z^.d.} 
and S x - {E:(E 1> E 2 ):|E 1 | ^ |E 2 |;E lt E 2 p.d.} . Then, SJLS^ 4> . To prove 
Th. 8.2.1 it suffices to show that for any ZeS Q , say Z q , and for any 
ZeS^, say Z^, 

p (u|h o2 ,e o ) - p(<o|h 12 ,z 1 ) < 0 . 


Consider any Z ■ (Z^,Z 2 ) . 

Without loss of generality, we can take m I and Z^ *» 8 , the diagonal 
matrix with diagonal elements 8^.8^ . 


P(o)|H 12 ) = c r 


‘1* 


c 

P n 2 ,p 


(S lf S 2 )fea) 


! s i 


(n 1 -p-l)/2 


|S 5 


(n 2 -p-l)/2 


|e| 


-n x /2 


exp 


H 


tr(0 _1 S 1 +S 2 )> 


dS x dS 2 


c c 

n^,p n 2’P 


<n-p-l>/2 (n,-p-l)/2 

lD r | 10,1 2 


(i,u 2 )eu> 


~ n i ^ f i 1 

je j 1 exp|- J tr(6 J -+U 2 )U 1 VdU 1 dU 2 


(I,U 2 )to) 


(n,-p-l)/2 -n./2 , /, 

|u 2 | 2 |e! |(e ^upl n/2 du 2 


where ■ U^, S 2 * uJ /2 u 2 oJ /2 

that U * U^ 2 !^ 2 and b ■ c 
111 n 

Jacobian of the transformation 


1/2 

with U^' a symmetric matrix such 

c /c , n - n.+n. . The 
x ,p n 2 ,p n,p 1 2 

(Sj^.Sj) -*• (U X ,U 2 ) is given by 
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3(S 1 ,S 2 ) . ^ i(p+l)/2 

3<U lf U 2 ) |U 1' 

1/2 1/2 

Put V ■ 0 0 7 . Let Li* be the set of all pxp positive definite 

- 1/2 - 1/2 

matrices V such that (1,8 V 0 )fc go , and oj be the set of all 
pxp positive definite symmetric matrices V such that (1,V)fc u) . Then, 


P(w|H 02 ) - P(u|H 19 ) - b 


12 " 


Li 


Li* 


(n -p-l)/2 , 

|V| 2 |(I+V)| /2 dV 


lvl” 2/2 1| v f<P + « /2 dv . 

Now consider the following. 

Lemma . If and S 2 are two positive definite matrices of the 

same dimension, p*p , then 


= b< 


Li-LiHLi* 


CL)*-wna)* 






Proof 1 Since and S 2 are two p.d. matrices, there exists a 

nonsingular matrix M such that MS^M' * I and MS 2 M' ■ A “ diagonal 
where <5^ > 0, i*l,...,p are the characteristic roots of 
S 2 S~* . To prove the lemma, it is enough to show 


1 


^Professor Olkin has pointed out that this can be proven alternatively 
through the concavity of |s|^**. 
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(3) 


m 


{|MS M' | 1/p +|MS 2 M'f L/p } P < |MS 1 M’+MS 2 M' 
i.e. {|l| 1/p +|Al 1/p } P < {|I+A|} 


L.e. 11 + n 6 1/p > < n (1+6 ) 

l 1=1 1 > 1=1 1 


1/p 


which is ttrue by Holder's Inequality. Hence the lemma. 
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(by the Lemma) 

| v |-(p+l)/2 (Jv < . q.e.D. 
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9. Remarks . As with any multidimensional measure, the SGV cannot 
be expected to be the unique measure best for all situations of multi¬ 
dimensional scatter. However, if we are interested in 'overall' scatter 
and where magnitude of individual variances separately are not of great con¬ 
cern, the SGV can be expected to perform adequately. Wilks (1967) gives an 
expository account of GV as a measure of multidimensional scatter from 
geometrical standpoint. 

This paper suggests several topics for future research. Admissibility 
of the LRTs may be studied. Also, alternative test procedures to LRTs, e.g., 
by the Union-Intersection method will be interesting. Sequential and non- 
parametric test procedures for SGVs may provide further insights into the 
problem. The case of one sided alternatives and singular dispersion matrices 
seem to be of great practical importance and interest. Finally, one can 
explore situations where SGVs may not be adequate and for such situations 
provide adequate measures and tests for multidimensional scatter. 
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